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DNA construct and its use. 
The present invention relates to a new DNA construct for transformation into oilseed 
plants. The DNA construct comprises nucleotide sequences encoding peptides with enzyme 
5 activities necessary for the high-level production and esterification of keto group-containing 
xanthophylls in oilseed plants. 

Background of the invention 

Carotenoids are produced de novo by plants, fungi, algae and some bacteria. A 
number of biosynthetic steps are needed for the biological production of the carotenoids. 
10 There are two chemically different groups of carotenoids, namely carotenes containing only 
carbon and hydrogen molecules and xanthophylls containing oxygen in the molecule in 
addition to carbon and hydrogen. 

The xanthophylls, and particularly astaxantWn (3,3'-dihydroxy-p-p-carotene-4,4'- 
dione), are often colored pigments and are used as such or as anti-oxidants. 
15 Carotenes are biological precursors for the production of the oxygen-containing 

xanthophylls. There are two types of enzymes responsible for the introduction of hydroxy 
groups and keto groups into the carotenes, namely hydroxylases and ketolases, respectively. 

The keto group-containing xanthophyll astaxanthin, which has keto and hydroxy 
groups, is biosynthetically produced from beta-carotene. 
20 Large-scale production of xanthophylles from natural sources is at present performed 

by AstaCarotene AB, Gustavsberg, Sweden, by cultivation of the alga Haematococcus 
pluvialis for the production of astaxanthin in esterified form. 

It would be desirable to be able to produce keto group-containing xanthophylls 
particularly astaxanthin 3 in oilseed plants. Oilseed plants have naturally p-carotene 
25 hydroxylases but lack p-carotene C-4-oxygenase enzymes or ketolases. 

Description of the invention 
The present invention provides DNA constructs enabling and promoting 
production of keto group containing xanthophylls, especially astaxanthin, in oilseed plants, 
such as rape, sunflower, soybean and mustard. The DNA construct is transformed into the 
30 oilseed plant cell for expression of a protein or fused protein which has an enzyme activity 
enabling keto group insertion into a carotene or hydroxy carotene for the biosynthetic 
production of a keto group containing xanthophyll, such as cantaxanthin (P,P-carotene-4,4*- 
dione) and/or astaxanthin. Use is thus made of the biosynthetic pathway of the oilseed plant to 
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produce carotenoids. The naturally occurring synthesis of carotenoids involves a number of 
enzymes, namely 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphaterdimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, lycopene beta- 
5 cyclase, p-carotene hydroxylase, and p-carotene C-4-oxygenase. Genes coding for peptides 
having these enzymatic activities may be inserted into the DNA construct of the invention, 
one or several per construct, to promote high-level production in the transgenic oilseed plant. 
In case only one enzyme coding gene is inserted per plant, two or more plants may be 
sexually interbred to produce plants containing all the desired enzyme activities. 

10 Thus, the present invention is directed to a DNA construct comprising in the 5' to 3' 

direction of transcription operably linked a promoter region directing transcription to the seed 
of an oilseed plant, a nucleotide sequence coding for at least one peptide with enzyme activity 
necessary for keto group containing xanthophyll production and esterification in an oilseed 
plant and a transcriptional termination region. 

15 In a preferred embodiment of the invention the DNA construct additionally 

comprises between the promoter region and the nucleotide sequence coding for at least one 
peptide with enzyme activity a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant. 

The DNA construct is preferably such that the promoter is a napin promoter, the 

20 peptide with enzyme activity necessary for keto group containing xanthophyll production is 
selected from the group consisting of peptides with l-D-deoxyxylulose 5-phosphate synthase, 
isopentenyl pyrophosphate:dimethylallyl pyrophosphate isomerase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, p-carotene hydroxylase, and P-carotene C-4-oxygenase activity. To 

25 promote esterification of astaxanthin a nucleotide sequence coding for a peptide with acyl 
transferase activity may be included in the group. 

In a preferred embodiment of the DNA construct according to the invention the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 
coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 

30 Haematococcus pluvialis. 

An example of the DNA construct of the invention is presented in the sequence 
listing as SEQ ID NO: 1 and in Fig. 1 . 
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The present invention is also directed to a transgenic oilseed plant cell 
comprising the DNA construct of the invention, and preferably the oilseed plant is selected 
from the group consisting of rape, sunflower, soybean and mustard. 

The invention is additionally directed to transgenic oilseed plant-produced 
5 xanthophyll, e.g. canthaxanthin and astaxanthin. 

A preferred aspect of the invention is directed to transgenic oilseed plant- 
produced astaxanthin esters. 

The present invention will now be illustrated with reference to the DNA 
construct disclosed in the sequence listing and in Fig.l, and the following description of 
10 embodiments. However, the invention is not limited to these exemplifications. 

Short description of the drawings 
Fig.l illustrates the nucleotide sequence of the DNA construct comprising the napin promoter, 
the chloroplast localization signal, the N-terminally truncated P-carotene C-4-oxygenase gene 
and the termination sequence, and the deduced amino acid sequences of the transit peptide 
15 and the P-carotene C-4-oxygenase. 

Description of embodiments 
The invention is illustrated by production of astaxanthin in the seed of oilseed 
rape. The astaxanthin produced in the seed of the transgenic plant is extracted as part of the 
extracted oil. By use of conventionally used protocols for Agrobacterium tumefaciens 
20 mediated transformation such as described by (Hoekema et all 983, An et al. 1986, Fry et al. 
1987, DeBlock et al 1988, Radke et al.1988, or Moloney et al. 1989) transgenic plants are 
produced having a chimeric DNA construct that is genetically inherited and is able to produce 
astaxanthin. The nucleotide sequence of the chimeric DNA construct consist of four parts of 
different genetic origin namely: (1) a promoter, (2) a localization signal, (3) a P-carotene C-4- 
25 oxygenase coding region and (4) a termination sequence. 

The napin promoter directs transcription to the seed of oilseed rape (Stalberg et 
al 1996). This promoter was coupled to a localization signal similar but not identical to a 
transit peptide (TP) of Rbcsla (Krebbers, 1988) that directs the translated product of a fused 
gene to the chloroplast. The promoter and the TP sequence were ligated to a part of the coding 
30 sequence of a ketolase gene BCK (Kajiwara et al. 1 995). This enzyme oxygenates p-carotene 
to canthaxanthin, (Fraser et al. 1997). The chimeric DNA construct was then coupled to a 
suitable termination sequence, e.g. that of the Agrobacterium tumefaciens nopaline synthase 
gene (the nos 3' end)(Bevan et al. 1983), as illustrated in Fig.l. 
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Cellular storage of Astaxamin 

The storage of large amounts of free astaxanthin in plants will be difficult due to 
toxic effects of the molecule as it intercalates in the plant membranes. An effective 
esterification of astaxanthin to fatty acids enables storage of the cstcrified molecules in 
triacylglycerol containing oleosomes. Thus, an acyl transferase can be claimed to be of 
fundamental importance for the process, as is proteins that can mediate transport of different 
forms of astaxanthin from the chloroplast to the vesicles. 
Sequences and oligonucleotides used in the construction of the DNA construct 
7. Napin promoter (GeneBank ACCESSION No. J02798) 

This promoter sequence, a 1 145 base pair fragment including the 5' leader 
sequence has a unique Hindlll site at the 5' end. The 3' end was synthesized with an 
additionally 6 nucleotide BamHI site. 

2. Transit peptide similar to RBCSla (GeneBank ACCESSION No. XI 3611 X14565) 

The transit peptide (TP) was amplified by PCR from -28 to the end of the transit 
cleavage aa=54/55 site of the Rbcsla gene. The 5* end was synthesized with a BamHI site 
and similarly the 3' sequence was synthesized with a Xbal site. The two following 
oligonucleotides were used for the PCR amplification. 

BamHI 

5 9 primer: TP 1 5 ' AGAC GGATCC TC AGTCACACAAAGAGTA 3 ' 

Sad Xbal 

3 ' primer: TP2 5 'GTTC GAGCTC TCTAGA CATGCAGTTAACGC 3 ' 

3. BCK (ficarotene C-4 oxygenase) (Genebank ACCESSION No. D45881) 

The BCK fragment was amplified by PCR including a 5' Xbal site and was 
ligated to the TP already described. The 5' primer (BCK1) used for PCR, is homologous to 
the BCK sequence from nucleotide 264 and the 3' oligonucleotide (Ax40) ends with a stop 
codon and was synthesized with a Sad restriction site for cloning. The synthesized fragment 
was fused to the TP as shown in Fig I. 
Oligonucleotides used for PCR: 

Xbal 

5 ' primer BCK1 5 'AC AG TCTAGA ATGCCATCCGAGTCGTCA 3 ' 

Sad 

3 'primer: AX40 5 'CACCGAGCTCCATGAC ACTCTTGTGCAGA 3 ' 
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15 



Description of SEQ ID NO:l and SEQ ID NO:2 
The sequences shown i Fig. I are the same as the two sequences which are 
shown in the sequence listing. 

The SEQ ID NO:l is a nucleotide sequence composed of the following features: 

Nucleotide No. 
Cloning site Hindlll 1 -6 

Napin Promoter 1-1145 
Cloning site BamHI 1146-1151 
Transit peptide leader 1152-1178 
Transit peptide coding 1 1 79- 1 347 

Cloning site Xbal 1348-1353 
p-carotene C-4-oxygenase 1354-2217 
P-carotene C-4-oxygense 3' untranslated 2218-2266 
Cloning site Sad 2267-2272 
Nopaline synthetase termination 2273-2536 
Cloning site EcoRI 2538-2543 



the 



The SEQ ID NO: 2 is a deduced amino acid sequence of the fusion protein of 
transit peptide and the peptide with P-carotene C-4-oxygenase activity. 
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Claims 

1. A DNA construct comprising in the 5' to 3' direction of transcription operably 
linked a promoter region directing transcription to the seed of an oilseed plant, a nucleotide 
sequence coding for at least one peptide with enzyme activity necessary for keto group 

5 containing xanthophyll production and esterification in an oilseed plant and a transcriptional 
termination region. 

2. The DNA construct according to claim 1 , which between the promoter region 
and the nucleotide sequence coding for at least one peptide with enzyme activity additionally 
comprises a nucleotide sequence coding for a transit peptide directing the translated fusion 

10 polypeptide to the chloroplast of the oilseed plant. 

3. The DNA construct according to claim 1 or 2, wherein the promoter is a napin 
promoter, the peptide with enzyme activity necessary for keto group containing xanthophyll 
production and esterification is selected from the group consisting of peptides with, 1-D- 
deoxyxylulose 5-phosphate synthase, isopentenyl pyrophosphaterdimethylallyl pyrophosphate 

15 isomerase, geranylgeranyl pyrophosphate synthase, phytoene synthase, phytoene desaturase, 
zeta-carotene desaturase, lycopene beta-cyclase, P-carotene hydroxylase, P-carotene C-4- 
oxygenase, and acyl transferase activity. 

4. The DNA construct according to any one of claims 1 - 3, wherein the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 

20 coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 
Haematococcus pluvialis. 

5. The DNA construct according to claim 4, wherein the nucleotide sequence is 
SEQ ID NO:l. 

6. Transgenic oilseed plant cell comprising the DNA construct of any one of 

25 claims 1-5 . 

7. Transgenic oilseed plant cell according to claim 6, wherein the oilseed plant is 
selected from the group consisting of rape, sunflower, soybean and mustard. 

8. Transgenic oilseed plant-produced xanthophyll. 

9. Transgenic oilseed plant-produced xanthophyll according to claim 8, wherein 
30 the xanthophyll is canthaxanthin 

10. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin. 

11. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanthin esters. 
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Napin promoter 

AAGCTTTCTTCATCGGTGATTGATTCCTTTAAAGACTTATGTTTCTTATCTTGCTTCTGA 

GGCAAGTATTCAGTTACCAGTTACCACTTATATTCTGGACTTTCTGACTGCATCCTCATT 

TTTC C AACATTTTAAATTTCACTATTGGCTGAAT GCTTCTT CTTTGAGG AAGAAACAATT 

C AGATGGCAGAAATGTATCAACCAATGCATAT AT ACAAATGTACCTCTT GTTCT CAAAAC 

ATCTATCGGATGGTTCCATTTGCTTTGTCATCCAATTAGTGACTACTTTATATTATTCAC 

TCCTCTTTATTACTATTTTCATGCGAGGTTGCCATGTACATTATATTTGTAAGGATTGAC 

GCTATTGAGCGTTTTTCTTCAATTTTCTTTATTTTAGACATGGGTATGAAATGTGTGTTA 

GAGTTGGGTTGAATGAGATAT ACGTTCAAGTGAAGTGGCAT ACCGTT CT CGAGTAAGGAT 

GACCTACCCATTCTTGAGACAAATGTTACATTTTAGTATCAGAGTAAAATGTGTACCTAT 

AACTCMATTCGATTGACATGTATCCATTCAACATAAAATTAAACCAGCCTGCACCTGCA 

TCCACATTTCAAGTATTTTCAAACCGTTCGGCTCCTATCCACCGGGTGTAACAAGACGGA 

TTCC GAATTTGGAAGATTTTGACTCAAATTCC CAATTT ATATTGAC CGTGACTAAATCAA 

CTTTAACTTCTATAATTCTGATTAAGCTCCCAATTTATATTCCCAACGGCACTACCTCCA 

AAATTTATAGACTCTCATCCCCTTTTAAACCAACTTAGTAAACGTTTTTTTTTTTAATTT 

TATGAAGTTAAGTTTTT ACCTTGTTTTTAAAAAGAATCGTTC ATAAGAT G C CATGCCAGA 

ACATTAGCTACACGTTACACATAGCATGCAGCCGCGGAGAATTGTTTTTCTTCGCCACTT 

GTCACTCCCTTCAAACACCTAAGAGCTTCTCTCTCACAGCACACACATACAATCACATGC 

GTGCATGCATTATTACACGTGATCGCCATGCAAATCTCCTTTATAGCCTATAAATTAACT 

CATCCGCTTCACT CTTTACTCAAACCAAAACT CAT C AATACAAAC AAGATTAAAAAGAT A 

End -2 8 untranslated leader TP start 

CACGAGGATCCTCAGTCACACAAAGAGTAAAGAAGAACAATGGCTTCCTCTATGCTCTCT 

M A S S M L S 

TCCGCTACTATGGTTGCCTCTCCGGCTCAGGCCACTATGGTCGCTCCTTTCAACGGACTT 
SATMVAS PAQATMVAPFNGL 

AAGTCCTCCGCTGCCTTCCCAGCCACCCGCAAGGCTAACAACGACATTACTTCCATCACA 
KSSAAFPATRKANNDITSIT 



FIG.l 
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TP End C-4 -Oxygenase 

AGCAACGGCGGACGCGTTAACTGCATGTCTAGAATGCCATCCGAGTCGTCAGACGCAGCT 
SNGGRVNCMSRMPSES SDAA 

CGTCCTGCGCTAAAGCACGCCTACAAACCTCCAGCATCTGACGCCAAGGGCATCACGATG 
RPALKHAYKPPASDAKGITM 

GCGCTGACCATCATTGGCACCTGGACCGCAGTGTTTTTACACGCAATATTTCAAATCAGG 
ALTI IGTWTAVFLHAI F Q I R 

CTACCGACATCCATGGACCAGCTTCACTGGTTGCCTGTGTCCGAAGCCACAGCCCAGCTT 
LPTSMDQLHWLPVSEATAQL 

TTGGGCGGAAGCAGCAGCCTACTGCACATCGCTGCAGTCTTCATTGTACTTGAGTTCCTG 
LGGSSSLLHIAAVFIVLEFL 

TACACTGGTCTATTCATCACCACACATGACGCAATGCATGGCACCATAGCTTTGAGGCAC 
YTGLFI TTHDAMHGTI ALRH 

AGGCAGCTCAATGATCTCCTTGGCAACATCTGCATATCACTGTACGCCTGGTTTGACTAC 
RQLNDLLGNICI SLYAWFDY 

AGCATGCTGCATCGCAAGCACTGGGAGCACCACAACCATACTGGCGAAGTGGGGAAAGAC 
SMLHRKHWEHHNHTGEVGKD 

CCTGACTTCCACAAGGGAAATCCCGGCCTTGTCCCCTGGTTCGCCAGCTTCATGTCCAGC 
PDFHKGNPGLVPWFAS FMSS 

TACATGTCCCTGTGGCAGTTTGCCCGGCTGGCATGGTGGGCAGTGGTGATGCAAATGCTG 
YMSLWQF ARLAWWAVVMQML 

GGGGCGCCCATGGCAAATCTCCTAGTCTTCATGGCTGCAGCCCCAATCTTGTCAGCATTC 
GAPMANLLVFMAAAPILSAF 

CGCCTCTTCTACTTCGGCACTTACCTGCCACACAAGCCTGAGCCAGGCCCTGCAGCAGGC 
RLFY FGTYLPHKP EP G P A A G 

TCTCAGGTGATGGCCTGGTTCAGGGCCAAGACAAGTGAGGCATCTGATGTGATGAGTTTC 
SQVMAWFRAKTSEASDVMSF 

CTGACATGCTACCACTTTGACCTGCACTGGGAGCACCACAGATGGCCCTTTGCCCCCTGG 
LTCYHFDLHWEHHRWPFAPW 

C-4 oxygenase Stop 
TGGCAGCTGCCCCACTGCCGCCGCCTGTCCGGGCGTGGCCTGGTGCCTGCCTTGGCATGA 
WQLPHCRRLSGRGLVPALA* 



FIG.l (cont . ) 
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C-4 oxygenase untranslated region Kos term 

CCTGGTCCCTCCGCTGGTGACCCAGCGTCTGCACAAGAGTGTCATGGAGCTCQAATTTCC 

CCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTG 

CGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAAT 

GCATGAC GTT ATTTATG AGATGGGTTTTTATG ATT AGAGT CCCG CAATT AT ACATTT AAT 

ACGCGAT AGAAAACAAAAT ATAGCGCGCAAACT AGGATAAATTATCGC GCG CGGTGTCAT 
end 

CTATGTTACTAGATCGGGAATTC 



Fig. 1 (cont . ) 
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SEQUENCE LISTING 



<110> AstaCarotene AB 



<12 0> DNA construct and its use 

<130> 29295 -AstaCarotene 

<140> 
<14I> 

<160> 2 

<170> Patentln Ver. 2.1 
<210> l 

<211> 2543 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: napin promoter 

+ choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence + termination sequence 

<220> 

<221> promoter 
<222> (1) . . (1145) 

<220> 

<221> trans it_pep tide 
<222> (1179) . . (1347) 

<220> 
<221> CDS 

<222> (1179) . . (2217) 
<220> 

<2 2l> terminator 
<222> (2273) . . (2536) 

<400> 1 



aagctttctt 


catcggtgat 


tgattccttt 


aaagacttat 


gtttcttatc 


ttgcttctga 


60 


ggcaagtatt 


cagttaccag 


ttaccactta 


tattctggac 


tttctgactg 


catcctcatt 


120 


tttccaacat 


tttaaatttc 


actattggct 


gaatgcttct 


tctttgagga 


agaaacaatt 


180 


cagatggcag 


aaatgtatca accaatgcat 


atatacaaat 


gtacctcttg 


ttctcaaaac 


240 


atctatcgga 


tggttccatt 


tgctttgtca 


tccaattagt 


gactacttta 


tattattcac 


300 


tcctctttat 


tactattttc 


atgcgaggtt 


gccatgtaca 


ttatatttgt 


aaggattgac 


360 


gctattgagc 


gtttttcttc 


aattttcttt 


attttagaca 


tgggtatgaa 


atgtgtgtta 


420 


gagttgggtt 


gaatgagata 


tacgttcaag 


tgaagtggca 


taccgttctc 


gagtaaggat 


480 


gacctaccca 


ttcttgagac 


aaatgttaca 


ttttagtatc 


agagtaaaat 


gtgtacctat 


540 



WO 01/20011 



2 



PCT/SE00/01767 



aactcaaatt cgattgacat gtatccattc aacataaaat taaaccagcc tgcacctgca 600 

tccacatttc aagtattttc aaaccgttcg gctcctatcc accgggtgta acaagacgga 660 

ttccgaattt ggaagatttt gactcaaatt cccaatttac attgaccgtg actaaatcaa 720 

ctttaacttc tataattctg attaagctcc caatttatat tcccaacggc actacctcca 7 80 

aaatttatag actctcatcc ccttttaaac caacttagta aacgtttttt tttttaattt 840 

tatgaagtta agtttttacc ttgtttttaa aaagaatcgt tcataagatg ccatgccaga 900 

acattagcta cacgttacac atagcatgca gccgcggaga attgtttttc ttcgccactt 960 

gtcactccct tcaaacacct aagagcttct ctctcacagc acacacatac aatcacatgc 1020 

gtgcatgcat tattacacgt gatcgccatg caaatctcct ttatagccta taaattaact 1080 

catccgcttc actctttact caaaccaaaa ctcatcaata caaacaagat taaaaacata 1140 

cacgaggatc ctcagtcaca caaagagtaa agaagaaca atg get tec tct atg 1194 

Met Ala Ser Ser Met 
1 5 

etc tct tec get act atg gtt gee tct ccg get cag gee act atg gtc 1242 
Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 15 20 

get cct ttc aac gga ctt aag tec tec get gee ttc cca gee acc cgc 1290 
Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gac att act tec ate aca age aac ggc gga cgc gtt 133 8 
Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cca tec gag teg tea gac gca get cgt cct 13 86 
Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cac gee tac aaa cct cca gca tct gac gee aag ggc ate 14 34 
Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly He 
70 75 80 85 

acg atg gcg ctg acc ate att ggc acc tgg acc gca gtg ttt tta cac 14 82 
Thr Met Ala Leu Thr lie lie Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gca ata ttt caa ate agg eta ccg aca tec atg gac cag ctt cac tgg 1530 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin Leu His Trp 
105 110 115 

ttg cct gtg tec gaa gec aca gee cag ctt ttg ggc gga age age age 1578 
Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gca gtc ttc att gta ctt gag ttc ctg tac act 1626 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 145 
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ggt eta ttc ate acc aca cat gac gca atg cat ggc acc ata get ttg 1674 
Gly Leu Phe lie Thr Thr His Asp Ala Met His Gly Thr He Ala Leu 
150 155 160 165 

agg cac agg cag etc aat gat etc ctt ggc aac ate tgc ata tea ctg 1722 
Arg Hie Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys He ser Leu 

170 175 160 

tac gec tgg ttt gac tac age atg ctg cat cgc aag cac tgg gag cac 1770 
Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His 
185 190 195 y 

cac aac cat act ggc gaa gtg ggg aaa gac cct gac ttc cac aag gga 1818 
His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
200 205 210 

aat ccc ggc ctt gtc ccc tgg ttc gee age ttc atg tec age tac atg 1866 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met 
215 220 225 

tec ctg tgg cag ttt gee egg ctg gca tgg tgg gca gtg gtg atg caa 1914 
Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val Met Gin 
230 235 240 245 

atg ctg ggg gcg ccc atg gca aat etc eta gtc ttc atg get gca gee 1962 
Met Leu' Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
250 255 260 

cca ate ttg tea gca ttc cgc etc ttc tac ttc ggc act tac ctg cca 2010 
Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 
265 270 275 

cac aag cct gag cca ggc cct gca gca ggc tct cag gtg atg gee tgg 2058 
His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp 
280 285 290 

ttc agg gec aag aca agt gag gca tct gat gtg atg agt ttc ctg aca 2106 
Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr 
295 300 305 

tgc tac cac ttt gac ctg cac tgg gag cac cac aga tgg ccc ttt gec 2154 
Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala 
310 315 320 325 

ccc tgg tgg cag ctg ccc cac tgc cgc cgc ctg tec ggg cgt ggc ctg 2202 
Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 
330 335 340 

gtg cct gec ttg gca tgacctggtc cctccgctgg tgacccagcg tetgeacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tccccgatcg ttcaaacatt tggcaataaa gtttcttaag 2317 

attgaatcct gttgccggtc ttgegatgat tatcatataa tttctgttga attacgttaa 2377 

gcatgtaata attaacatgt aatgeatgae gttatttatg agatgggttt ttatgattag 2437 

agtcccgcaa ttatacattt aatacgegat agaaaacaaa atatagegeg caaactagga 2497 

taaattatcg cgcgcggtgt catctatgtt actagategg gaattc 2543 
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<210> 2 
<211> 346 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: deduced fusion protein of 

transit peptide + peptide with beta-carotene C-4 oxygenase activity 

<400> 2 

Met Ala Ser Ser Met Leu Ser Ser Ala Thr Met Val Ala Ser Pro Ala 

15 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp lie Thr Ser lie Thr Ser 

35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 55 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 
65 70 75 80 

Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly Thr Trp Thr 
85 90 95 

Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 
100 105 110 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 120 125 

Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu 
130 135 140 

Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His 
145 150 155 160 

Gly Thr He Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 170 175 

He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 190 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 230 235 240 



Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 
245 250 255 
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Phe Met Ala Ala Ala Pro He Leu 
260 

Gly Thr Tyr Leu Pro His Lys Pro 
275 280 

Gin Val Met Ala Trp Phe Arg Ala 
290 295 

Met Ser Phe Leu Thr Cys Tyr His 
305 310 

Arg Trp Pro Phe Ala Pro Trp Trp 
325 

Ser Gly Arg Gly Leu Val Pro Ala 
340 



Ser Ala Phe Arg Leu Phe Tyr Phe 
265 270 

Glu Pro Gly Pro Ala Ala Gly Ser 
285 

Lys Thr Ser Glu Ala Ser Asp Val 
300 

Phe Asp Leu His Trp Glu His His 
315 320 

Gin Leu Pro His Cys Arg Arg Leu 
330 335 

Leu Ala 
345 
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